Overview

Dataset statistics

Number of variables26
Number of observations10155
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 MiB
Average record size in memory208.0 B

Variable types

NUM19
BOOL4
CAT3

Reproduction

Analysis started2020-06-12 11:46:00.892300
Analysis finished2020-06-12 11:48:10.511495
Duration2 minutes and 9.62 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

Portfolio Balance is highly correlated with Investment in DerivativeHigh correlation
Investment in Derivative is highly correlated with Portfolio BalanceHigh correlation
Personal Loan is highly skewed (γ1 = 23.11911535) Skewed
Online Purchase Amount is highly skewed (γ1 = 20.3857396) Skewed
REF_NO has unique values Unique
children has 6208 (61.1%) zeros Zeros
Average Credit Card Transaction has 6190 (61.0%) zeros Zeros
Balance Transfer has 4392 (43.2%) zeros Zeros
Term Deposit has 5702 (56.1%) zeros Zeros
Life Insurance has 3061 (30.1%) zeros Zeros
Medical Insurance has 5033 (49.6%) zeros Zeros
Average A/C Balance has 3482 (34.3%) zeros Zeros
Personal Loan has 6382 (62.8%) zeros Zeros
Investment in Mutual Fund has 3257 (32.1%) zeros Zeros
Investment Tax Saving Bond has 6373 (62.8%) zeros Zeros
Home Loan has 6976 (68.7%) zeros Zeros
Online Purchase Amount has 7081 (69.7%) zeros Zeros
Investment in Commudity has 1021 (10.1%) zeros Zeros
Investment in Equity has 1139 (11.2%) zeros Zeros
Investment in Derivative has 539 (5.3%) zeros Zeros

Variables

REF_NO
Real number (ℝ≥0)

UNIQUE

Distinct count10155
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5770.830822255047
Minimum1
Maximum11518
Zeros0
Zeros (%)0.0%
Memory size79.3 KiB

Quantile statistics

Minimum1
5-th percentile573.7
Q12903.5
median5770
Q38665.5
95-th percentile10935.3
Maximum11518
Range11517
Interquartile range (IQR)5762

Descriptive statistics

Standard deviation3324.837813
Coefficient of variation (CV)0.5761454314
Kurtosis-1.198925004
Mean5770.830822
Median Absolute Deviation (MAD)2882
Skewness-0.00552343364
Sum58602787
Variance11054546.49
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
20471< 0.1%
 
95021< 0.1%
 
74651< 0.1%
 
54161< 0.1%
 
95101< 0.1%
 
33631< 0.1%
 
13141< 0.1%
 
74571< 0.1%
 
54081< 0.1%
 
33551< 0.1%
 
Other values (10145)1014599.9%
 
ValueCountFrequency (%) 
11< 0.1%
 
21< 0.1%
 
31< 0.1%
 
51< 0.1%
 
61< 0.1%
 
ValueCountFrequency (%) 
115181< 0.1%
 
115161< 0.1%
 
115141< 0.1%
 
115131< 0.1%
 
115121< 0.1%
 

children
Real number (ℝ≥0)

ZEROS

Distinct count5
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.6456917774495322
Minimum0
Maximum4
Zeros6208
Zeros (%)61.1%
Memory size79.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile2
Maximum4
Range4
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.9204737444
Coefficient of variation (CV)1.425562128
Kurtosis0.2182512517
Mean0.6456917774
Median Absolute Deviation (MAD)0
Skewness1.173637238
Sum6557
Variance0.847271914
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0620861.1%
 
1184818.2%
 
2160715.8%
 
34734.7%
 
4190.2%
 
ValueCountFrequency (%) 
0620861.1%
 
1184818.2%
 
2160715.8%
 
34734.7%
 
4190.2%
 
ValueCountFrequency (%) 
4190.2%
 
34734.7%
 
2160715.8%
 
1184818.2%
 
0620861.1%
 

age_band
Categorical

Distinct count4
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size79.3 KiB
1
4666
2
3578
0
1446
3
 
465
ValueCountFrequency (%) 
1466645.9%
 
2357835.2%
 
0144614.2%
 
34654.6%
 

Length

Max length1
Median length1
Mean length1
Min length1

status
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size79.3 KiB
0
7757
1
2398
ValueCountFrequency (%) 
0775776.4%
 
1239823.6%
 

occupation
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size79.3 KiB
0
5534
1
4621
ValueCountFrequency (%) 
0553454.5%
 
1462145.5%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size79.3 KiB
1
9522
0
 
633
ValueCountFrequency (%) 
1952293.8%
 
06336.2%
 

family_income
Real number (ℝ≥0)

Distinct count13
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22.611619891678977
Minimum4.0
Maximum35.0
Zeros0
Zeros (%)0.0%
Memory size79.3 KiB

Quantile statistics

Minimum4
5-th percentile6.25
Q113.5
median23.5
Q328.5
95-th percentile35
Maximum35
Range31
Interquartile range (IQR)15

Descriptive statistics

Standard deviation9.598150509
Coefficient of variation (CV)0.4244786775
Kurtosis-1.135966584
Mean22.61161989
Median Absolute Deviation (MAD)8.5
Skewness-0.1962260335
Sum229621
Variance92.12449319
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
35251724.8%
 
26122712.1%
 
28.59949.8%
 
23.58338.2%
 
18.56836.7%
 
116776.7%
 
166346.2%
 
13.56296.2%
 
215905.8%
 
95635.5%
 
Other values (3)8088.0%
 
ValueCountFrequency (%) 
42782.7%
 
6.254024.0%
 
95635.5%
 
116776.7%
 
13.56296.2%
 
ValueCountFrequency (%) 
35251724.8%
 
28.59949.8%
 
26122712.1%
 
23.58338.2%
 
215905.8%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size79.3 KiB
0
9436
1
 
719
ValueCountFrequency (%) 
0943692.9%
 
17197.1%
 

year_last_moved
Real number (ℝ≥0)

Distinct count95
Unique (%)0.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1968.3763663220088
Minimum0
Maximum1999
Zeros84
Zeros (%)0.8%
Memory size79.3 KiB

Quantile statistics

Minimum0
5-th percentile1961
Q11978
median1988
Q31994
95-th percentile1998
Maximum1999
Range1999
Interquartile range (IQR)16

Descriptive statistics

Standard deviation180.2022422
Coefficient of variation (CV)0.09154867193
Kurtosis114.838814
Mean1968.376366
Median Absolute Deviation (MAD)7
Skewness-10.7823423
Sum19988862
Variance32472.84809
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
19976806.7%
 
19966546.4%
 
19945425.3%
 
19985385.3%
 
19954924.8%
 
19884234.2%
 
19934064.0%
 
19863873.8%
 
19923723.7%
 
19873563.5%
 
Other values (85)530552.2%
 
ValueCountFrequency (%) 
0840.8%
 
19012< 0.1%
 
19022< 0.1%
 
19031< 0.1%
 
19043< 0.1%
 
ValueCountFrequency (%) 
1999650.6%
 
19985385.3%
 
19976806.7%
 
19966546.4%
 
19954924.8%
 

Average Credit Card Transaction
Real number (ℝ≥0)

ZEROS

Distinct count1411
Unique (%)13.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean23.44175677006401
Minimum0.0
Maximum662.26
Zeros6190
Zeros (%)61.0%
Memory size79.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q323.98
95-th percentile130.567
Maximum662.26
Range662.26
Interquartile range (IQR)23.98

Descriptive statistics

Standard deviation50.87212745
Coefficient of variation (CV)2.170149957
Kurtosis18.50617043
Mean23.44175677
Median Absolute Deviation (MAD)0
Skewness3.619964534
Sum238051.04
Variance2587.973351
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0619061.0%
 
19.991471.4%
 
9.991081.1%
 
11.99700.7%
 
24.99600.6%
 
19.49560.6%
 
15.99550.5%
 
4.99540.5%
 
14.99540.5%
 
9.49500.5%
 
Other values (1401)331132.6%
 
ValueCountFrequency (%) 
0619061.0%
 
0.01450.4%
 
0.02100.1%
 
0.031< 0.1%
 
0.041< 0.1%
 
ValueCountFrequency (%) 
662.261< 0.1%
 
592.361< 0.1%
 
571.741< 0.1%
 
565.361< 0.1%
 
481.361< 0.1%
 

Balance Transfer
Real number (ℝ≥0)

ZEROS

Distinct count2183
Unique (%)21.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean46.41775972427376
Minimum0.0
Maximum2951.76
Zeros4392
Zeros (%)43.2%
Memory size79.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median17.96
Q365.385
95-th percentile188.96
Maximum2951.76
Range2951.76
Interquartile range (IQR)65.385

Descriptive statistics

Standard deviation78.47760933
Coefficient of variation (CV)1.690680675
Kurtosis193.3229914
Mean46.41775972
Median Absolute Deviation (MAD)17.96
Skewness7.210658269
Sum471372.35
Variance6158.735167
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0439243.2%
 
0.011891.9%
 
24.991671.6%
 
29.991481.5%
 
19.99870.9%
 
34.99770.8%
 
25.99710.7%
 
0.51590.6%
 
24.49530.5%
 
44.99520.5%
 
Other values (2173)486047.9%
 
ValueCountFrequency (%) 
0439243.2%
 
0.011891.9%
 
0.02470.5%
 
0.03130.1%
 
0.042< 0.1%
 
ValueCountFrequency (%) 
2951.761< 0.1%
 
860.831< 0.1%
 
858.781< 0.1%
 
749.381< 0.1%
 
659.211< 0.1%
 

Term Deposit
Real number (ℝ≥0)

ZEROS

Distinct count1419
Unique (%)14.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27.579851304775975
Minimum0.0
Maximum784.82
Zeros5702
Zeros (%)56.1%
Memory size79.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q334.99
95-th percentile126.926
Maximum784.82
Range784.82
Interquartile range (IQR)34.99

Descriptive statistics

Standard deviation53.95254985
Coefficient of variation (CV)1.956230629
Kurtosis26.12492154
Mean27.5798513
Median Absolute Deviation (MAD)0
Skewness3.997495289
Sum280073.39
Variance2910.877636
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0570256.1%
 
24.991581.6%
 
29.991541.5%
 
19.991211.2%
 
14.99890.9%
 
34.99840.8%
 
9.99800.8%
 
0.01740.7%
 
29.49540.5%
 
24.49540.5%
 
Other values (1409)358535.3%
 
ValueCountFrequency (%) 
0570256.1%
 
0.01740.7%
 
0.02140.1%
 
0.032< 0.1%
 
0.51330.3%
 
ValueCountFrequency (%) 
784.821< 0.1%
 
738.671< 0.1%
 
716.121< 0.1%
 
597.761< 0.1%
 
539.181< 0.1%
 

Life Insurance
Real number (ℝ≥0)

ZEROS

Distinct count3111
Unique (%)30.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean66.24213096996553
Minimum0.0
Maximum2930.41
Zeros3061
Zeros (%)30.1%
Memory size79.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median31.98
Q394.39
95-th percentile243.654
Maximum2930.41
Range2930.41
Interquartile range (IQR)94.39

Descriptive statistics

Standard deviation95.54531646
Coefficient of variation (CV)1.442364777
Kurtosis87.69084691
Mean66.24213097
Median Absolute Deviation (MAD)31.98
Skewness4.910322111
Sum672688.84
Variance9128.907498
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0306130.1%
 
0.01920.9%
 
27.99690.7%
 
29.99650.6%
 
24.99550.5%
 
25.99540.5%
 
17.99530.5%
 
22.99530.5%
 
19.99510.5%
 
34.99500.5%
 
Other values (3101)655264.5%
 
ValueCountFrequency (%) 
0306130.1%
 
0.01920.9%
 
0.02300.3%
 
0.0370.1%
 
0.081< 0.1%
 
ValueCountFrequency (%) 
2930.411< 0.1%
 
1005.531< 0.1%
 
825.021< 0.1%
 
817.631< 0.1%
 
805.281< 0.1%
 

Medical Insurance
Real number (ℝ≥0)

ZEROS

Distinct count1589
Unique (%)15.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19.142050221565732
Minimum0.0
Maximum591.04
Zeros5033
Zeros (%)49.6%
Memory size79.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0.51
Q327.47
95-th percentile82.439
Maximum591.04
Range591.04
Interquartile range (IQR)27.47

Descriptive statistics

Standard deviation32.45185602
Coefficient of variation (CV)1.695317672
Kurtosis20.0135372
Mean19.14205022
Median Absolute Deviation (MAD)0.51
Skewness3.242102818
Sum194387.52
Variance1053.122959
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0503349.6%
 
9.992142.1%
 
9.49820.8%
 
10.99690.7%
 
29.99670.7%
 
7.49650.6%
 
6.99620.6%
 
19.99610.6%
 
4.99610.6%
 
19.98500.5%
 
Other values (1579)439143.2%
 
ValueCountFrequency (%) 
0503349.6%
 
0.01360.4%
 
0.0260.1%
 
0.481< 0.1%
 
0.5190.1%
 
ValueCountFrequency (%) 
591.041< 0.1%
 
350.711< 0.1%
 
306.851< 0.1%
 
296.011< 0.1%
 
294.761< 0.1%
 

Average A/C Balance
Real number (ℝ≥0)

ZEROS

Distinct count2223
Unique (%)21.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32.084965041851305
Minimum0.0
Maximum626.24
Zeros3482
Zeros (%)34.3%
Memory size79.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median14.99
Q346.48
95-th percentile120.342
Maximum626.24
Range626.24
Interquartile range (IQR)46.48

Descriptive statistics

Standard deviation45.48661367
Coefficient of variation (CV)1.41769248
Kurtosis13.9870297
Mean32.08496504
Median Absolute Deviation (MAD)14.99
Skewness2.777490494
Sum325822.82
Variance2069.032023
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0348234.3%
 
4.991441.4%
 
29.991361.3%
 
11.991061.0%
 
9.991001.0%
 
14.99910.9%
 
24.99870.9%
 
34.99630.6%
 
2.99560.6%
 
3.49520.5%
 
Other values (2213)583857.5%
 
ValueCountFrequency (%) 
0348234.3%
 
0.01380.4%
 
0.024< 0.1%
 
0.031< 0.1%
 
0.051< 0.1%
 
ValueCountFrequency (%) 
626.241< 0.1%
 
616.171< 0.1%
 
415.141< 0.1%
 
410.711< 0.1%
 
402.61< 0.1%
 

Personal Loan
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct count1760
Unique (%)17.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26.00600295420975
Minimum0.0
Maximum4905.93
Zeros6382
Zeros (%)62.8%
Memory size79.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q321.48
95-th percentile134.368
Maximum4905.93
Range4905.93
Interquartile range (IQR)21.48

Descriptive statistics

Standard deviation84.27574321
Coefficient of variation (CV)3.240626534
Kurtosis1149.240577
Mean26.00600295
Median Absolute Deviation (MAD)0
Skewness23.11911535
Sum264090.96
Variance7102.400894
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0638262.8%
 
15.99530.5%
 
14.99420.4%
 
13.99370.4%
 
6.99370.4%
 
5.99360.4%
 
11.99330.3%
 
0.01320.3%
 
8.99310.3%
 
17.99280.3%
 
Other values (1750)344433.9%
 
ValueCountFrequency (%) 
0638262.8%
 
0.01320.3%
 
0.0290.1%
 
0.5170.1%
 
0.525< 0.1%
 
ValueCountFrequency (%) 
4905.931< 0.1%
 
1645.381< 0.1%
 
1309.081< 0.1%
 
1304.011< 0.1%
 
1280.21< 0.1%
 

Investment in Mutual Fund
Real number (ℝ≥0)

ZEROS

Distinct count2470
Unique (%)24.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean42.339697685869034
Minimum0.0
Maximum2561.27
Zeros3257
Zeros (%)32.1%
Memory size79.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median23.48
Q359.555
95-th percentile150.903
Maximum2561.27
Range2561.27
Interquartile range (IQR)59.555

Descriptive statistics

Standard deviation63.89889948
Coefficient of variation (CV)1.509195931
Kurtosis248.7072951
Mean42.33969769
Median Absolute Deviation (MAD)23.48
Skewness8.398389891
Sum429959.63
Variance4083.069355
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0325732.1%
 
11.991361.3%
 
9.991001.0%
 
13.99870.9%
 
23.98770.8%
 
19.98660.6%
 
23.48650.6%
 
11.49590.6%
 
0.01490.5%
 
17.99470.5%
 
Other values (2460)621261.2%
 
ValueCountFrequency (%) 
0325732.1%
 
0.01490.5%
 
0.02180.2%
 
0.033< 0.1%
 
0.041< 0.1%
 
ValueCountFrequency (%) 
2561.271< 0.1%
 
765.031< 0.1%
 
648.541< 0.1%
 
646.391< 0.1%
 
633.891< 0.1%
 

Investment Tax Saving Bond
Real number (ℝ≥0)

ZEROS

Distinct count832
Unique (%)8.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.112070901033974
Minimum0.0
Maximum156.87
Zeros6373
Zeros (%)62.8%
Memory size79.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q35.975
95-th percentile32.543
Maximum156.87
Range156.87
Interquartile range (IQR)5.975

Descriptive statistics

Standard deviation12.83367452
Coefficient of variation (CV)2.099726054
Kurtosis14.82400131
Mean6.112070901
Median Absolute Deviation (MAD)0
Skewness3.234383223
Sum62068.08
Variance164.7032016
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0637362.8%
 
12782.7%
 
21281.3%
 
4.991241.2%
 
9.991041.0%
 
19.99870.9%
 
4.49840.8%
 
2.49740.7%
 
2.99650.6%
 
3600.6%
 
Other values (822)277827.4%
 
ValueCountFrequency (%) 
0637362.8%
 
0.013< 0.1%
 
0.12< 0.1%
 
0.25< 0.1%
 
0.453< 0.1%
 
ValueCountFrequency (%) 
156.871< 0.1%
 
138.561< 0.1%
 
124.761< 0.1%
 
123.341< 0.1%
 
121.441< 0.1%
 

Home Loan
Real number (ℝ≥0)

ZEROS

Distinct count884
Unique (%)8.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.482001969473165
Minimum0.0
Maximum162.35
Zeros6976
Zeros (%)68.7%
Memory size79.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q34.49
95-th percentile24.46
Maximum162.35
Range162.35
Interquartile range (IQR)4.49

Descriptive statistics

Standard deviation9.982640872
Coefficient of variation (CV)2.227272754
Kurtosis24.83350195
Mean4.482001969
Median Absolute Deviation (MAD)0
Skewness3.888050096
Sum45514.73
Variance99.65311877
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0697668.7%
 
4.991211.2%
 
9.99930.9%
 
4.49870.9%
 
3.99670.7%
 
3.49610.6%
 
7.99490.5%
 
2.99480.5%
 
1.99470.5%
 
5.99460.5%
 
Other values (874)256025.2%
 
ValueCountFrequency (%) 
0697668.7%
 
0.0190.1%
 
0.53< 0.1%
 
0.513< 0.1%
 
0.743< 0.1%
 
ValueCountFrequency (%) 
162.351< 0.1%
 
122.671< 0.1%
 
121.921< 0.1%
 
114.391< 0.1%
 
110.171< 0.1%
 

Online Purchase Amount
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct count1319
Unique (%)13.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19.162772033481044
Minimum0.0
Maximum4306.42
Zeros7081
Zeros (%)69.7%
Memory size79.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q37.98
95-th percentile93.453
Maximum4306.42
Range4306.42
Interquartile range (IQR)7.98

Descriptive statistics

Standard deviation89.66626341
Coefficient of variation (CV)4.679190634
Kurtosis708.8697201
Mean19.16277203
Median Absolute Deviation (MAD)0
Skewness20.3857396
Sum194597.95
Variance8040.038795
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0708169.7%
 
3.99610.6%
 
11.99590.6%
 
14.99410.4%
 
7.99410.4%
 
4.49390.4%
 
4.99370.4%
 
2.99360.4%
 
9.99330.3%
 
19.99290.3%
 
Other values (1309)269826.6%
 
ValueCountFrequency (%) 
0708169.7%
 
0.01190.2%
 
0.025< 0.1%
 
0.041< 0.1%
 
0.051< 0.1%
 
ValueCountFrequency (%) 
4306.421< 0.1%
 
2808.81< 0.1%
 
2142.621< 0.1%
 
2033.851< 0.1%
 
1652.451< 0.1%
 

Revenue Grid
Categorical

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size79.3 KiB
2
9069
1
 
1086
ValueCountFrequency (%) 
2906989.3%
 
1108610.7%
 

Length

Max length1
Median length1
Mean length1
Min length1

gender
Categorical

Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size79.3 KiB
0
7634
1
2486
2
 
35
ValueCountFrequency (%) 
0763475.2%
 
1248624.5%
 
2350.3%
 

Length

Max length1
Median length1
Mean length1
Min length1

Investment in Commudity
Real number (ℝ≥0)

ZEROS

Distinct count3558
Unique (%)35.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36.56488626292467
Minimum0.0
Maximum1231.09
Zeros1021
Zeros (%)10.1%
Memory size79.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q18.23
median23.98
Q350.79
95-th percentile115.863
Maximum1231.09
Range1231.09
Interquartile range (IQR)42.56

Descriptive statistics

Standard deviation42.27053025
Coefficient of variation (CV)1.156041617
Kurtosis69.90895292
Mean36.56488626
Median Absolute Deviation (MAD)18.69
Skewness4.299542009
Sum371316.42
Variance1786.797728
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0102110.1%
 
4600.6%
 
5550.5%
 
6530.5%
 
2450.4%
 
7440.4%
 
3340.3%
 
3.9330.3%
 
3.2320.3%
 
3.6320.3%
 
Other values (3548)874686.1%
 
ValueCountFrequency (%) 
0102110.1%
 
0.0160.1%
 
0.021< 0.1%
 
0.1280.3%
 
0.121< 0.1%
 
ValueCountFrequency (%) 
1231.091< 0.1%
 
412.961< 0.1%
 
385.241< 0.1%
 
384.171< 0.1%
 
373.31< 0.1%
 

Investment in Equity
Real number (ℝ≥0)

ZEROS

Distinct count3250
Unique (%)32.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21.69869423929099
Minimum0.0
Maximum1279.1
Zeros1139
Zeros (%)11.2%
Memory size79.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q14.67
median12.98
Q328.3
95-th percentile69.756
Maximum1279.1
Range1279.1
Interquartile range (IQR)23.63

Descriptive statistics

Standard deviation31.89384106
Coefficient of variation (CV)1.469850706
Kurtosis285.6367969
Mean21.69869424
Median Absolute Deviation (MAD)10.09
Skewness10.27230932
Sum220350.24
Variance1017.217097
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0113911.2%
 
3.33720.7%
 
2560.6%
 
5530.5%
 
1.67460.5%
 
2.5430.4%
 
4370.4%
 
6.66360.4%
 
4.17340.3%
 
0.83320.3%
 
Other values (3240)860784.8%
 
ValueCountFrequency (%) 
0113911.2%
 
0.011< 0.1%
 
0.021< 0.1%
 
0.083< 0.1%
 
0.09120.1%
 
ValueCountFrequency (%) 
1279.11< 0.1%
 
717.741< 0.1%
 
556.191< 0.1%
 
434.151< 0.1%
 
419.991< 0.1%
 

Investment in Derivative
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count3796
Unique (%)37.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31.988646971935005
Minimum0.0
Maximum1771.16
Zeros539
Zeros (%)5.3%
Memory size79.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q18.74
median21.34
Q342.98
95-th percentile98.176
Maximum1771.16
Range1771.16
Interquartile range (IQR)34.24

Descriptive statistics

Standard deviation39.10634725
Coefficient of variation (CV)1.222507075
Kurtosis395.8226205
Mean31.98864697
Median Absolute Deviation (MAD)15.09
Skewness10.81440814
Sum324844.71
Variance1529.306395
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
05395.3%
 
3.33560.6%
 
5510.5%
 
2390.4%
 
1.67350.3%
 
4.91330.3%
 
5.83310.3%
 
2.5300.3%
 
4.17280.3%
 
6.58280.3%
 
Other values (3786)928591.4%
 
ValueCountFrequency (%) 
05395.3%
 
0.012< 0.1%
 
0.09130.1%
 
0.11< 0.1%
 
0.17110.1%
 
ValueCountFrequency (%) 
1771.161< 0.1%
 
533.981< 0.1%
 
456.121< 0.1%
 
421.551< 0.1%
 
411.391< 0.1%
 

Portfolio Balance
Real number (ℝ)

HIGH CORRELATION

Distinct count8317
Unique (%)81.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean90.4602373215165
Minimum-78.43
Maximum4283.56
Zeros0
Zeros (%)0.0%
Memory size79.3 KiB

Quantile statistics

Minimum-78.43
5-th percentile-13.474
Q126.605
median66.2
Q3125.935
95-th percentile273.305
Maximum4283.56
Range4361.99
Interquartile range (IQR)99.33

Descriptive statistics

Standard deviation107.2654754
Coefficient of variation (CV)1.185774861
Kurtosis236.9206806
Mean90.46023732
Median Absolute Deviation (MAD)46.14
Skewness7.724671171
Sum918623.71
Variance11505.8822
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.3660.1%
 
53.265< 0.1%
 
57.95< 0.1%
 
305< 0.1%
 
71.455< 0.1%
 
47.755< 0.1%
 
67.795< 0.1%
 
118.524< 0.1%
 
77.144< 0.1%
 
77.414< 0.1%
 
Other values (8307)1010799.5%
 
ValueCountFrequency (%) 
-78.431< 0.1%
 
-77.231< 0.1%
 
-76.351< 0.1%
 
-74.071< 0.1%
 
-73.351< 0.1%
 
ValueCountFrequency (%) 
4283.561< 0.1%
 
1109.571< 0.1%
 
1097.441< 0.1%
 
1053.81< 0.1%
 
1024.681< 0.1%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

REF_NOchildrenage_bandstatusoccupationhome_statusfamily_incomeself_employedyear_last_movedAverage Credit Card TransactionBalance TransferTerm DepositLife InsuranceMedical InsuranceAverage A/C BalancePersonal LoanInvestment in Mutual FundInvestment Tax Saving BondHome LoanOnline Purchase AmountRevenue GridgenderInvestment in CommudityInvestment in EquityInvestment in DerivativePortfolio Balance
010200116.001972148.44142.950.0081.960.0029.990.0061.9519.990.000.001074.6718.6632.3289.43
120211126.0019980.0074.980.0025.990.000.000.000.000.000.000.002020.190.004.3322.78
230010128.5119960.00166.4420.99291.3711.48166.940.0015.990.003.490.002198.0631.0780.96171.78
350010113.5019970.000.000.0020.490.0039.460.0045.440.000.000.00204.1014.1517.57-41.70
460100128.50199573.4557.960.00177.4241.9539.4710.97212.840.0045.9125.982070.1655.8680.44235.02
570200116.0019840.00125.450.00129.960.000.000.0088.950.000.000.002051.0814.8336.4972.30
680100135.0019850.0029.4897.420.0239.4563.970.0049.420.000.000.002033.2718.9025.48115.28
791100135.0019930.000.01175.8867.470.00110.9586.3974.9341.7135.4111.992048.6760.2363.58162.40
8100110116.00199525.000.0076.4524.990.0087.930.00253.2712.487.218.992125.2961.6563.11105.20
9112100135.0019930.0074.9862.98141.9144.480.0013.9949.950.0010.497.982064.8713.7441.7293.09

Last rows

REF_NOchildrenage_bandstatusoccupationhome_statusfamily_incomeself_employedyear_last_movedAverage Credit Card TransactionBalance TransferTerm DepositLife InsuranceMedical InsuranceAverage A/C BalancePersonal LoanInvestment in Mutual FundInvestment Tax Saving BondHome LoanOnline Purchase AmountRevenue GridgenderInvestment in CommudityInvestment in EquityInvestment in DerivativePortfolio Balance
10145115050200128.50019760.000.000.0017.470.000.000.0037.474.9918.723.49203.4910.789.9946.21
10146115060101126.00019690.000.000.000.000.000.000.000.0039.490.000.00100.006.586.5846.86
10147115073111126.00019940.000.000.00270.840.005.480.0035.970.000.000.001054.176.9152.05138.70
1014811509020116.25019789.9924.4929.4813.9924.9929.990.0011.990.007.490.002020.598.2513.4961.10
10149115110201126.001198221.981.03171.90197.3242.4448.3324.989.999.3011.9748.982186.9325.5955.39170.07
10150115121100128.50019720.000.000.0029.976.490.008.997.494.490.000.00207.293.509.5710.23
10151115130100023.50019880.00110.950.00200.410.002.990.0014.990.000.000.002062.273.0036.40102.62
10152115142101135.0001992124.930.0054.480.0084.420.000.000.000.000.000.002052.770.0014.0776.18
1015311516020119.00019700.0035.980.000.000.000.000.009.980.007.980.00207.202.991.664.79
10154115181101126.00019960.0040.460.00310.249.98105.96106.9443.4619.490.000.002072.1445.9899.35227.12